Generic Systolic Array for Run-Time Scalable Cores

نویسندگان

  • Andrés Otero
  • Yana Esteves Krasteva
  • Eduardo de la Torre
  • Teresa Riesgo
چکیده

This paper presents a scalable core architecture based on a generic systolic array. The size of this kind of cores can be adapted in real-time to cover changing application requirements or to the available area in a reconfigurable device. In this paper, the process of scaling the core is performed by the replication of a single processing element using run-time partial reconfiguration. Furthermore, rather than restricting the proposed solution to a given application, it is based on a generic systolic architecture which is adapted using a design flow which is also proposed. The paper includes a related work discussion, the proposal and definition of a systolic array communication approach, which does not require the use of specific macro structures and permits to achieve higher flexibility, and a design flow used to adapt the generic architecture. Further, the paper also includes an image filter application as a simple use case, along with implementation results for Virtex 5 FPGA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FPGA-Based Configurable Systolic Architecture for Window-Based Image Processing

Image processing requires more computational power and data throughput than most conventional processors can provide. Designing specific hardware can improve execution time and achieve better performance per unit of silicon area. A fieldprogrammable-gate-array(FPGA-) based configurable systolic architecture specially tailored for real-time window-based image operations is presented in this pape...

متن کامل

Enhancing FPGA Robustness via Generic Monitoring IP Cores

Today, state of the art technology allows a very dense integration of embedded HW/SW designs. As a consequence, more errors are introduced in these circuits that have to be observed during run-time. Adding monitors to a design enables the recognition of and the reaction to these threats, but, usually, monitors have to be developed for every individual FPGA design. Our approach provides generic ...

متن کامل

Task Scheduling for Many-Cores with S-NUCA Caches

A many-core processor may comprise a large number of processing cores on a single chip. The many-core’s last-level shared cache can potentially be physically distributed alongside the cores in the form of cache banks connected through a Network on Chip (NoC). Static Non-Uniform Cache Access (SNUCA) memory address mapping policy provides a scalable mechanism for providing the cores quick access ...

متن کامل

FPGA Implementation of a Scalable and Run-Time Adaptable Multi-Standard Packet Detector

This paper describes a step by step approach for implementing a scalable and run-time adaptable multi-standard packet detector for orthogonal frequency divisional multiplexing (OFDM) based communication standards. The paper briefly describes considerations and design choices in making a modular system block with generic control supporting rapid prototyping and implementation of preamble-based p...

متن کامل

Generic Program Representation and Evaluation of Systolic Computations on Multicomputers

An overview of the uniied model (proposed earlier by us) for compiling systolic computations for multicomputers is given. Subsequently, we consider the diagonal and farming implementationalobjects which represent two speciic cases of the implementational objects deened in the model. For these two objects, we give prototype codes in the Occam language and give performance models for multi-transp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010